AITopics | Tamaulipas

Collaborating Authors

Tamaulipas

SAGRAD: A Program for Neural Network Training with Simulated Annealing and the Conjugate Gradient Method

arXiv.org Artificial IntelligenceJan-31-2025

SAGRAD (Simulated Annealing GRADient), a Fortran 77 program for computing neural networks for classification using batch learning, is discussed. Neural network training in SAGRAD is based on a combination of simulated annealing and M{\o}ller's scaled conjugate gradient algorithm, the latter a variation of the traditional conjugate gradient method, better suited for the nonquadratic nature of neural networks. Different aspects of the implementation of the training process in SAGRAD are discussed, such as the efficient computation of gradients and multiplication of vectors by Hessian matrices that are required by M{\o}ller's algorithm; the (re)initialization of weights with simulated annealing required to (re)start M{\o}ller's algorithm the first time and each time thereafter that it shows insufficient progress in reaching a possibly local minimum; and the use of simulated annealing when M{\o}ller's algorithm, after possibly making considerable progress, becomes stuck at a local minimum or flat area of weight space. Outlines of the scaled conjugate gradient algorithm, the simulated annealing procedure and the training process used in SAGRAD are presented together with results from running SAGRAD on two examples of training data.

algorithm, artificial intelligence, machine learning, (18 more...)

arXiv.org Artificial Intelligence

doi: 10.6028/jres.120.009

2502.00112

Country:

North America > United States > Oklahoma > Payne County > Cushing (0.04)
North America > Mexico > Tamaulipas (0.04)
Europe > Denmark (0.04)
(3 more...)

Genre:

Workflow (0.70)
Research Report (0.50)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Gradient Descent (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Add feedback

Understanding the Interplay between Parametric and Contextual Knowledge for Large Language Models

Cheng, Sitao, Pan, Liangming, Yin, Xunjian, Wang, Xinyi, Wang, William Yang

arXiv.org Artificial IntelligenceOct-10-2024

Large language models (LLMs) encode vast amounts of knowledge during pre-training (parametric knowledge, or PK) and can further be enhanced by incorporating contextual knowledge (CK). Can LLMs effectively integrate their internal PK with external CK to solve complex problems? In this paper, we investigate the dynamic interaction between PK and CK, categorizing their relationships into four types: Supportive, Complementary, Conflicting, and Irrelevant. To support this investigation, we introduce ECHOQA, a benchmark spanning scientific, factual, and commonsense knowledge. Our results show that LLMs tend to suppress their PK when contextual information is available, even when it is complementary or irrelevant. While tailored instructions can encourage LLMs to rely more on their PK, they still struggle to fully leverage it. These findings reveal a key vulnerability in LLMs, raising concerns about their reliability in knowledge-intensive tasks. Resources are available at https://github.com/sitaocheng/Knowledge_Interplay

large language model, machine learning, natural language, (15 more...)

arXiv.org Artificial Intelligence

2410.08414

Country:

Asia > Singapore (0.04)
Asia > Indonesia > Bali (0.04)
Asia > Thailand > Bangkok > Bangkok (0.04)
(13 more...)

Genre: Research Report > New Finding (1.00)

Industry:

Media (1.00)
Leisure & Entertainment > Sports (0.68)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Fourier Series Guided Design of Quantum Convolutional Neural Networks for Enhanced Time Series Forecasting

Osorio, Sandra Leticia Juárez, Ruiz, Mayra Alejandra Rivera, Mendez-Vazquez, Andres, Rodriguez-Tello, Eduardo

arXiv.org Artificial IntelligenceApr-24-2024

In this study, we apply 1D quantum convolution to address the task of time series forecasting. By encoding multiple points into the quantum circuit to predict subsequent data, each point becomes a feature, transforming the problem into a multidimensional one. Building on theoretical foundations from prior research, which demonstrated that Variational Quantum Circuits (VQCs) can be expressed as multidimensional Fourier series, we explore the capabilities of different architectures and ansatz. This analysis considers the concepts of circuit expressibility and the presence of barren plateaus. Analyzing the problem within the framework of the Fourier series enabled the design of an architecture that incorporates data reuploading, resulting in enhanced performance. Rather than a strict requirement for the number of free parameters to exceed the degrees of freedom of the Fourier series, our findings suggest that even a limited number of parameters can produce Fourier functions of higher degrees. This highlights the remarkable expressive power of quantum circuits. This observation is also significant in reducing training times. The ansatz with greater expressibility and number of non-zero Fourier coefficients consistently delivers favorable results across different scenarios, with performance metrics improving as the number of qubits increases.

architecture, coefficient, qubit, (15 more...)

arXiv.org Artificial Intelligence

2404.15377

Country:

North America > Mexico > Tamaulipas (0.04)
North America > Mexico > Jalisco (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
Europe > Switzerland (0.04)

Genre: Research Report > New Finding (1.00)

Industry: Health & Medicine (0.94)

Technology:

Information Technology > Hardware (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.52)

Add feedback

Unlock the Future of Autonomous Drones with Innovative Secure Runtime Assurance (SRTA)

IEEE Spectrum RoboticsApr-18-2024, 15:52:55 GMT

By submitting this content request, I have legitimate interest in the content and agree that Technology Innovation Institute, their partners, and the creators of any other content I have selected may contact me regarding news, products, and services that may be of interest to me. By submitting this content request, I have legitimate interest in the content and agree that Technology Innovation Institute, their partners, and the creators of any other content I have selected may contact me regarding news, products, and services that may be of interest to me. I agree to the IEEE Privacy Policy Are you an IEEE member?

autonomous drone, information, innovative secure runtime assurance, (9 more...)

IEEE Spectrum Robotics

Country:

Oceania > Australia > Australian Indian Ocean Territories > Territory of Cocos (Keeling) Islands (0.15)
Asia > China > Hong Kong (0.15)
Oceania > Samoa (0.07)
(285 more...)

Industry:

Health & Medicine (0.49)
Consumer Products & Services (0.49)
Government (0.31)

Technology: Information Technology > Artificial Intelligence > Robots > Autonomous Vehicles > Drones (0.43)

Add feedback

Explainable Multi-hop Question Generation: An End-to-End Approach without Intermediate Question Labeling

Hwang, Seonjeong, Kim, Yunsu, Lee, Gary Geunbae

arXiv.org Artificial IntelligenceMar-31-2024

In response to the increasing use of interactive artificial intelligence, the demand for the capacity to handle complex questions has increased. Multi-hop question generation aims to generate complex questions that requires multi-step reasoning over several documents. Previous studies have predominantly utilized end-to-end models, wherein questions are decoded based on the representation of context documents. However, these approaches lack the ability to explain the reasoning process behind the generated multi-hop questions. Additionally, the question rewriting approach, which incrementally increases the question complexity, also has limitations due to the requirement of labeling data for intermediate-stage questions. In this paper, we introduce an end-to-end question rewriting model that increases question complexity through sequential rewriting. The proposed model has the advantage of training with only the final multi-hop questions, without intermediate questions. Experimental results demonstrate the effectiveness of our model in generating complex questions, particularly 3- and 4-hop questions, which are appropriately paired with input answers. We also prove that our model logically and incrementally increases the complexity of questions, and the generated multi-hop questions are also beneficial for training question answering models.

complexity, multi-hop question, proceedings, (16 more...)

arXiv.org Artificial Intelligence

2404.00571

Country:

North America > United States > California > Sacramento County > Sacramento (0.14)
North America > Mexico > Tamaulipas > Nuevo Laredo (0.05)
North America > United States > California > San Francisco County > San Francisco (0.04)
(4 more...)

Genre: Research Report > New Finding (0.48)

Industry:

Education (0.93)
Government > Voting & Elections (0.93)
Government > Regional Government > North America Government > United States Government (0.68)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Question Answering (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.47)

Add feedback

Ontology in Hybrid Intelligence: a concise literature review

Pileggi, Salvatore F.

arXiv.org Artificial IntelligenceOct-16-2023

In a context of constant evolution and proliferation of AI technology,Hybrid Intelligence is gaining popularity to refer a balanced coexistence between human and artificial intelligence. The term has been extensively used in the past two decades to define models of intelligence involving more than one technology. This paper aims to provide (i) a concise and focused overview of the adoption of Ontology in the broad context of Hybrid Intelligence regardless of its definition and (ii) a critical discussion on the possible role of Ontology to reduce the gap between human and artificial intelligence within hybrid intelligent systems. Beside the typical benefits provided by an effective use of ontologies, at a conceptual level, the conducted analysis has pointed out a significant contribution of Ontology to improve quality and accuracy, as well as a more specific role to enable extended interoperability, system engineering and explainable/transparent systems. Additionally, an application-oriented analysis has shown a significant role in present systems (70+% of the cases) and, potentially, in future systems. However, despite the relatively consistent number of papers on the topic, a proper holistic discussion on the establishment of the next generation of hybrid-intelligent environments with a balanced co-existence of human and artificial intelligence is fundamentally missed in literature. Last but not the least, there is currently a relatively low explicit focus on automatic reasoning and inference in hybrid intelligent systems.

hybrid intelligence, intelligence, ontology, (10 more...)

arXiv.org Artificial Intelligence

2303.17262

Country:

Asia > Japan > Honshū > Kantō > Tokyo Metropolis Prefecture > Tokyo (0.14)
Europe > Portugal > Porto > Porto (0.04)
Oceania > Australia > New South Wales > Sydney (0.04)
(12 more...)

Genre: Overview (1.00)

Industry:

Information Technology > Smart Houses & Appliances (0.86)
Health & Medicine > Pharmaceuticals & Biotechnology (0.68)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Ontologies (1.00)
Information Technology > Artificial Intelligence > Issues (1.00)

Add feedback

Adversarial Attacks Assessment of Salient Object Detection via Symbolic Learning

Olague, Gustavo, Pineda, Roberto, Ibarra-Vazquez, Gerardo, Olague, Matthieu, Martinez, Axel, Bakshi, Sambit, Vargas, Jonathan, Reducindo, Isnardo

arXiv.org Artificial IntelligenceSep-11-2023

Machine learning is at the center of mainstream technology and outperforms classical approaches to handcrafted feature design. Aside from its learning process for artificial feature extraction, it has an end-to-end paradigm from input to output, reaching outstandingly accurate results. However, security concerns about its robustness to malicious and imperceptible perturbations have drawn attention since its prediction can be changed entirely. Salient object detection is a research area where deep convolutional neural networks have proven effective but whose trustworthiness represents a significant issue requiring analysis and solutions to hackers' attacks. Brain programming is a kind of symbolic learning in the vein of good old-fashioned artificial intelligence. This work provides evidence that symbolic learning robustness is crucial in designing reliable visual attention systems since it can withstand even the most intense perturbations. We test this evolutionary computation methodology against several adversarial attacks and noise perturbations using standard databases and a real-world problem of a shorebird called the Snowy Plover portraying a visual attention task. We compare our methodology with five different deep learning approaches, proving that they do not match the symbolic paradigm regarding robustness. All neural networks suffer significant performance losses, while brain programming stands its ground and remains unaffected. Also, by studying the Snowy Plover, we remark on the importance of security in surveillance activities regarding wildlife protection and conservation.

artificial intelligence, database, machine learning, (19 more...)

arXiv.org Artificial Intelligence

2309.059

Country:

North America > United States (0.93)
Asia > India (0.14)
North America > Mexico > Querétaro (0.14)
(2 more...)

Genre: Research Report > New Finding (0.67)

Industry:

Information Technology > Security & Privacy (1.00)
Government (1.00)
Energy > Oil & Gas (1.00)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

An approach based on Open Research Knowledge Graph for Knowledge Acquisition from scientific papers

Jiomekong, Azanzi, Tiwari, Sanju

arXiv.org Artificial IntelligenceAug-23-2023

A scientific paper can be divided into two major constructs which are Metadata and Full-body text. Metadata provides a brief overview of the paper while the Full-body text contains key-insights that can be valuable to fellow researchers. To retrieve metadata and key-insights from scientific papers, knowledge acquisition is a central activity. It consists of gathering, analyzing and organizing knowledge embedded in scientific papers in such a way that it can be used and reused whenever needed. Given the wealth of scientific literature, manual knowledge acquisition is a cumbersome task. Thus, computer-assisted and (semi-)automatic strategies are generally adopted. Our purpose in this research was two fold: curate Open Research Knowledge Graph (ORKG) with papers related to ontology learning and define an approach using ORKG as a computer-assisted tool to organize key-insights extracted from research papers. This approach was used to document the "epidemiological surveillance systems design and implementation" research problem and to prepare the related work of this paper. It is currently used to document "food information engineering", "Tabular data to Knowledge Graph Matching" and "Question Answering" research problems and "Neuro-symbolic AI" domain.

artificial intelligence, knowledge management, scientific paper, (19 more...)

arXiv.org Artificial Intelligence

2308.12981

Country:

North America > United States > California (0.04)
North America > Mexico > Tamaulipas (0.04)
Europe > Serbia > Central Serbia > Belgrade (0.04)
(2 more...)

Genre:

Workflow (1.00)
Research Report (1.00)
Overview (1.00)

Industry:

Health & Medicine > Public Health (1.00)
Health & Medicine > Epidemiology (1.00)

Technology:

Information Technology > Knowledge Management > Knowledge Engineering (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Ontologies (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Expert Systems (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Semantic Networks (0.92)

Add feedback

Text2KGBench: A Benchmark for Ontology-Driven Knowledge Graph Generation from Text

Mihindukulasooriya, Nandana, Tiwari, Sanju, Enguix, Carlos F., Lata, Kusum

arXiv.org Artificial IntelligenceAug-4-2023

The recent advances in large language models (LLM) and foundation models with emergent capabilities have been shown to improve the performance of many NLP tasks. LLMs and Knowledge Graphs (KG) can complement each other such that LLMs can be used for KG construction or completion while existing KGs can be used for different tasks such as making LLM outputs explainable or fact-checking in Neuro-Symbolic manner. In this paper, we present Text2KGBench, a benchmark to evaluate the capabilities of language models to generate KGs from natural language text guided by an ontology. Given an input ontology and a set of sentences, the task is to extract facts from the text while complying with the given ontology (concepts, relations, domain/range constraints) and being faithful to the input sentences. We provide two datasets (i) Wikidata-TekGen with 10 ontologies and 13,474 sentences and (ii) DBpedia-WebNLG with 19 ontologies and 4,860 sentences. We define seven evaluation metrics to measure fact extraction performance, ontology conformance, and hallucinations by LLMs. Furthermore, we provide results for two baseline models, Vicuna-13B and Alpaca-LoRA-13B using automatic prompt generation from test cases. The baseline results show that there is room for improvement using both Semantic Web and Natural Language Processing techniques.

large language model, machine learning, natural language, (16 more...)

arXiv.org Artificial Intelligence

2308.02357

Country:

Europe > Greece (0.04)
North America > Mexico > Tamaulipas (0.04)
Asia > Middle East > UAE > Abu Dhabi Emirate > Abu Dhabi (0.04)
(5 more...)

Genre: Research Report (0.70)

Industry:

Media > Film (1.00)
Leisure & Entertainment (1.00)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Ontologies (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Exploring In-Context Learning Capabilities of Foundation Models for Generating Knowledge Graphs from Text

Khorashadizadeh, Hanieh, Mihindukulasooriya, Nandana, Tiwari, Sanju, Groppe, Jinghua, Groppe, Sven

arXiv.org Artificial IntelligenceMay-15-2023

Knowledge graphs can represent information about the real-world using entities and their relations in a structured and semantically rich manner and they enable a variety of downstream applications such as question-answering, recommendation systems, semantic search, and advanced analytics. However, at the moment, building a knowledge graph involves a lot of manual effort and thus hinders their application in some situations and the automation of this process might benefit especially for small organizations. Automatically generating structured knowledge graphs from a large volume of natural language is still a challenging task and the research on sub-tasks such as named entity extraction, relation extraction, entity and relation linking, and knowledge graph construction aims to improve the state of the art of automatic construction and completion of knowledge graphs from text. The recent advancement of foundation models with billions of parameters trained in a self-supervised manner with large volumes of training data that can be adapted to a variety of downstream tasks has helped to demonstrate high performance on a large range of Natural Language Processing (NLP) tasks. In this context, one emerging paradigm is in-context learning where a language model is used as it is with a prompt that provides instructions and some examples to perform a task without changing the parameters of the model using traditional approaches such as fine-tuning. This way, no computing resources are needed for re-training/fine-tuning the models and the engineering effort is minimal. Thus, it would be beneficial to utilize such capabilities for generating knowledge graphs from text.

large language model, machine learning, natural language, (14 more...)

arXiv.org Artificial Intelligence

2305.08804

Country:

Europe > Greece (0.04)
Oceania > Australia > New South Wales > Sydney (0.04)
North America > United States (0.04)
(5 more...)

Genre: Research Report > New Finding (0.46)

Industry:

Health & Medicine > Therapeutic Area > Infections and Infectious Diseases (1.00)
Health & Medicine > Therapeutic Area > Immunology (0.99)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Semantic Networks (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback